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ABSTRACT 

Aims. We investigate the performance of grid-based techniques in estimating the age of stars in detached eclipsing binary systems. 
We evaluate the precision of the estimates due to the uncertainty in the observational constraints, M ii2 , T e //,i, 2 , [Ee///]^, and the 
systematic bias caused by the uncertainty in convective core overshooting, element diffusion, mixing-length value, and initial helium 
content. 

Methods. We adopted the SCEPtER grid, which includes stars with mass in the range [0.8; 1.6] M e and evolutionary stages from the 
zero-age main sequence to the central hydrogen depletion. Age estimates have been obtained by a generalisation of the maximum 
likelihood technique described in our previous work. 

Results. We showed that the typical lcr random error in age estimates - due only to the uncertainty affecting the observational 
constraints - is about ±7%, which is nearly independent of the masses of the two stars. However, such an error strongly depends 
on the evolutionary phase and becomes larger and asymmetric for stars near the ZAMS where it ranges from about +90% to -25%. 
The systematic bias due to the including convective core overshooting - for mild and strong overshooting scenarios - is about 50% 
and 120% of the error due to observational uncertainties. A variation of ±1 in the helium-to-metal enrichment ratio AF/AZ accounts 
for about ±150% of the random error. The neglect of microscopic diffusion accounts for a bias of about 60% of the error due to 
observational uncertainties. We also introduced a statistical test of the expected difference in the recovered age of two coeval stars 
in a binary system. We find that random fluctuations within the current observational uncertainties can lead genuine coeval binary 
components to appear to be non-coeval with a difference in age as high as 60%. 
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1. Introduction 


The possibility of obtaining direct measurements of stellar mass 
and radius makes eclipsing bi nary systems the objects of funda - 
mental importance (see e.g. lAndersenlll99ll : lTorres et alJ l2010). 
Nowadays, for stars in double-lined systems, these quantities can 
be determin ed with precisions of a few percentage points or bet¬ 
ter (see e.g. lClausen et alJl2008l : IPavlovski et aUl2014h . Thus, 
detached eclipsing binaries offer a unique test bed for evolution¬ 
ary models of single stars, giving the opportunity of obtaining 
accurate estimates of their ages. 


However, this age estimation is usually performed by fit¬ 
ting the observed data with isoc hrones or stellar evolu tionary 
tracks (see among many others I South worth et al.il2011l: Torres 


et al. l2012l : lVos et alJ2012hIPavlovski et al.ll2014l) . Recently, dif¬ 
ferent techniques have been established for single stars, based 
upon a maximum lik elihood estimation over a grid o f pre¬ 
computed modelsje^ Gai_eta]2j20 l]jBasu_e^_aL \2012 .Gen¬ 
ii am et al. 20121: Mathur et alJl2012l: Prada Moroni et al. 2012; 


IValle et ah 1201 4L 20151) . Such methods are very fast and flexible 

since they can simultaneously take all the available observables 
into account, reaching high precision in the estimates and self- 
consistently evaluating their errors. 


Send offprint requests to: G. Valle, valle@df.unipi.it 


Despite these appealing features, very few efforts can be 
found in the literature that address the performance of these 
techniques for binary systems and that investigate the possible 
sources of their internal bias. Two studies that address , amo ng 
other topic s, the problem of bina ry ages are by iGennaro et al.1 
rt20l2h and [Schneider e t al] (120141) . The former is a Bayesian in¬ 
vestigation of the age reconstruction of pre-main sequence stars, 
which also considers binary systems. The authors adopted nine 
synthetic test systems, subjected each of them to 100 Monte 
Carlo perturbations, and evaluated the fraction of fake non¬ 
coeval estimates at lcr level. The results indicated a fraction of 
erroneous non-coevality from 0% to 95% with a mean of 13%. 
Given the restricted sample of explored systems, iGennaro et al] 
( 2012 ) recognised the importance of a full analysis of the ex¬ 
pected differences in age estimates in different ranges of mass, 
metallicity, and evolutionary phases. The last was a study by 
ISchneider et al . ( 120141) . who tested a Bayesian technique for age 
estimation and, st other tasks, compared the inferred stellar ages 
for the primary and secondary stars of eclipsing binaries in the 
mass range between 4.5 and 28 M 0 . In a sample of 18 binary 
systems, they found agreement at 1 cr level in the estimated ages 
of 17 of them. They also evaluated the mean difference in age 
of 14 systems (systems with mass ratio higher than 0.97 were 
excluded) to be 0.9 + 2.3 Myr at the 95% confidence level. 

The aim of this paper is to theoretically analyse the accu¬ 
racy of grid-based age estimates for detached eclipsing binary 
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systems. We also study the bias on these estimates due to the 
uncertainties in some physical mechanisms adopted in the mod¬ 
els (convective core overshooting and element microscopic dif¬ 
fusion efficiencies, mixing-length value), and in the stellar chem¬ 
ical composition (mainly the initial helium content). 

Finally, we introduce a statistical test of the expected differ¬ 
ence in the reconstructed ages of two coeval binary components 
simply due to the observational uncertainties. Such a test is of 
interest because the recovered non-coevality of stars in binary 
systems - i.e. the inability to fit both components with a single 
isochrone - is often used to claim some deficiency in current 
generation of stellar models. This sometimes leads to introduc¬ 
ing and/or calibrating some physical processes in evolutionary 
codes, such as convective core overshooting, or to varying the 


eter (see e.g. Andersen 1991; Pols et al. 19971; Ribas et al. 

2000 

Torres et al. 

2006; Morales et al. 2009; Clausen et al. 2009, 

2010 

Torres et al 

2010, 20141 and references therein). 


We restrict our analysis to central hydrogen-burning stars 
with mass in the range from 0.8 to 1.6 M Q . A similar investi¬ 
gation has already been performed dValle et al.il2015l) for the 
age determination of isolated single stars for which the main ob¬ 
servables adopted in the reconstruction were different from the 
present ones. In that case neither the mass nor the radius were 
available, while in the present study we do not rely on the aster- 
oseismic observables. As discussed in the following, the perfor¬ 
mances and the systematic biases of grid-based recovery tech¬ 
niques sensitively depend on the specific set of observables ac¬ 
tually adopted in the recovery procedure. 

The structure of the paper is the following. In Sect. [2] we dis¬ 
cuss the method and the grids used in the estimation process. The 
main results are presented in Sects. [3]and[4] In Sect.[5]we present 
a statistical test of the expected differences between the single¬ 
star age estimates. Some concluding remarks can be found in 
Sect. [6] The Appendix [A] describes the sampling strategy fol¬ 
lowed to build the synthetic dataset. 

2. Grid-based recovery technique 

The basic technique, derived from lBasu et alJ (2012 )_and suitable 
for is olated star estimation, is described in IValle et ail d2014l 
1201 5h . hereafter V14 and V15. Age estimates are based upon 
a modified SCEPtER (Stellar CharactEristics Pisa Estimation 
gRid) scheme, adapted for binary stars. The code and the stan¬ 
dard grid developed for this work are available in the R package 
SCEPtERbinary Q on CRAN. 

The adopted implementation assumes that >Si and 
S 2 are detached binary system stars for which the 
following vectors of observed quantities are avail¬ 
able: q Sl1 = {r eff , 5l2 ,[Fe/H] 5l2 ,M 5l2 ,R 5l2 }. We let 

o- 1 ’ 2 = {o-(r eff , 5l2 ),cr([Fe/H] 5l2 ),o-(M 5l2 ),cr(R 5l2 )) be the 

nominal uncertainty in the observed quantities. For each 
point j on the estimation grid of stellar models, we define 
q j = |Fe-ll|,A/,.A’ ; 

likelihood functions defined as 
r 4 . \ 


Let j be the single-star 


-C 1 ’ 2 ; = 


1 


(0 vSc 


.1,2 


x exp 


y 2 ' 
A 12 


(1) 


where 


Ti , 2 


2 




( 2 ) 


http://CRAN.R-proj ect.org/package=SCEPtERbinary 


The single-star likelihood functions are independently eval¬ 
uated for the two stars for each grid point within 3<x of all the 
variables from <Si,2; let X 1,2 max be the two maximum values ob¬ 
tained in this step. The single-star ages are estimated by averag¬ 
ing the corresponding quantity of all the models with likelihood 
greater than 0.95 x XmL- Informative priors can be inserted as a 
multiplicative factor in Eq. (UK as a weight attached to the grid 
points. 

In the following we assume, implicitly or explicitly, that the 
stars in the binary system are coeval. This assumption justifies 
the adoption of a single age as representative of the whole sys¬ 
tem. A first possible way to estimate the age of the binary system 
is simply to take the mean of the ages of the single components, 
computed independently. This method has a computational com¬ 
plexity of 0{n\ +n j), where nip. are the sizes of the 3cr grid-point 
samples. 

A second approach explicitly assumes in the likelihood com¬ 
putation that the stars are coeval and computes the joint likeli¬ 
hood function only for the couples of models in the two 3<x boxes 
with ages within 10 Myr. The joint likelihood is computed as the 
product of the single-star likelihood functions. Let Xmax be the 
maximum value obtained in this step. The joint-star estimated 
age is obtained by averaging the corresponding quantity of all the 
couples of models with likelihood greater than 0.95 x Xmax- This 
technique has a computational complexity of 0(n 1 x 112 ). Read¬ 
ers interested in the technical solutions implemented are referred 
to the code provided in the above-mentioned SCEPtERbinary R 
package. This estimation method is assumed as our standard in 
the following. 

The described technique is similar to those previously em¬ 
ployed in the literature in the framework of isochrone x 2 fit¬ 
ting for binary_age estimations (see, amongst many, [Pols et ali 
119971: lLastennet & Valls-Gabaudl 12002 ). The main differences 
are that the quoted x 2 approaches adopt a time-consuming nu¬ 
merical functional minimisation as the best model estimates and 
use x 2 profiles to infer the errors on the estimated parameters, 
often fixing the value of the masses to their observed values. 
However these met hods can lead to possibly severe error u nder- 
estimation (see e.g. Bas u et aTIteOlOtlOuirion et al.ll2010h . The 
technique adopted in this a nd similar grid-based approaches (e.g. 
iGai et alJl201 lfclBasu et al. 2012) computes as best estimate a lo¬ 
cal mean of the best-matching models, as described above. The 
error on the estimates are then computed by a Monte Carlo simu¬ 
lation. This approach allows a confidence interval to be provided 
for the age estimate and observational parameter correlations to 
be considered that are not addressed in y 2 profiles (see e.g. Las- 
tennet & Valls-Gabaud 20021 : Basu et alj|2010h . ~ 


Although in the current paper we are not interested in ob¬ 
taining a statistical confidence interval for the age of observed 
systems, the adopted method can be easily used to this purpose. 
To address this point, we reside on a generation of a synthetic 
sample of n binary systems, starting from the observed values 
and following, for each star, a multivariate normal distribution 
with vector of mean {g 5 ', q Sl ] and covariance matrix £. For the 
Monte Carlo simulations, a value of n = 10000 can be adopted 
since it provides a fair balance between computation time and 
accuracy of the result^. The median of the age of the n systems, 
obtained with one of the two methods described above, is taken 
as the best estimate of the true values; the 16th and 84th quantiles 
of the n values are adopted as a lcr confidence interval. 


2 The chosen value allows a mean relative accuracy of 0.1% to be 
reached on the lcr confidence interval. 
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2.1. Standard stellar model grid 


The standard estimation grid of ste llar models is obtained us- 
ing FRANEC stellar evolution code dDegl’Innocenti et~ap2008l : 
iTognelli et al.ll20llh . in the same configuration as adopted to 
comp ute the Pisa Stellar Evolution Data Basj^l for low-mass 
stars dDeirOmodarme et al.l 2(712 ; IPelTOmodarme & Vallel 
120131) . 


The grid consists of 141 680 points (110 points for 1 288 evo¬ 
lutionary tracks), corresponding to evolutionary stages from the 
ZAMS to the central hydrogen depletiorQ. Models are computed 
for masses in the range [0.80; 1.60] M 0 with a step of 0.01 M 0 . 
The initial metallicity [Fe/H] is assumed in the range [-0.55; 
0.55] with a step of 0.05 dex. The solar scaled heavy-element 
mixture by lAsplund et al.l d2009t) is adopted. The initial helium 
abundance is obtained using the linear relation Y — Y p + j-Z 
with the DrimordialabundanceTjj^^Oj^TSSfrom WMAP (Cy- 
burt et al. 12004 ; ISteigmanl2006t Peimbertet al.l2007allfc ), and as¬ 
suming AY/AZ — 2 ( Page! & Portinaril 19981 : Jimenez et al.l 2003 ; 


iGennaro et al.ll2010h . The models are computed by assuming the 

solar-scaled mixing-length parameter a m \ = 1.74. The convec¬ 
tive core overshooting extension is not considered. Further de- 
tails on the input adopte d in the computations are available in 
IValle et al.l d2009ll20T3allhh . 


3. Binary system age estimates: internal accuracy 

To evaluate the internal accuracy of grid-based age estimates 
of binary systems, we analysed the ideal case in which stellar 
models are in perfect agreement with stars. This is the most 
favourable case where the error in the estimated ages arises only 
from the uncertainties affecting the observational constraints 
used in the recovery procedure. 

To achieve such an ideal situation, the first age-estimation 
test was performed on a synthetic dataset obtained by sampling 
N - 50 000 artificial binary systems from the same standard es¬ 
timation grid of stellar models used in the recovery procedure. 
To simulate the effect of observational uncertainties, we added 
a Gaussian noise in all the observed quantities. We assumed 
as standard deviations 100 K in 7' c ii, 0.1 dex in [Fe/H], 1% in 
mass, and 0.5% in radius. Since the observationally inferred val¬ 
ues of a given physical quantity for the two binary components 
are correlated, correlation coefficients should be used in the co- 
variance matrix Z whenever a realistic noise has to be simulated. 
In our standard scenario, we assumed a correlation of 0.95 be¬ 
tween the primary and secondary effective temperatures, 0.95 
between the metallicities, 0.8 between the masses, and no cor¬ 
relation between the radii. Motivations of these choices and an 
analysis of the impact on the results of different assumptions of 
correlations amongst stellar quantities, radii included, are pre¬ 
sented in Sect. 13.11 

Figure Q] shows the Monte Carlo relative error in estimated 
age as a function of the mass and the relative age of the primary 
star. In this and in the following analogous figures, a positive 
value of the age relative error corresponds to an overestimated 
age. The relative age is defined as the ratio between the age of 
the star and the age of the same star at central hydrogen exhaus¬ 
tion (the age is conventionally set to 0 at the ZAMS position). It 
shows the position of the median relative error and the position 
of the lcr envelope of age-relative error as a function of mass 

3 http://astro.df.unipi.it/stellar-models/ 

4 The ZAMS and the central hydrogen depletion models are defined as 
the models whose central hydrogen abundance drops below 99% of the 
initial value and 1CT 30 , respectively. 


and relative age. The envelope is obtained as in V15, computing 
the 16th and 84th quantiles of the relative errors over a moving 
window in mass and relative ageQ 

Table Q] reports the position of the envelope as a function of 
the mass of the two stars, of their relative age, and of the mass 
ratio q of the systerrQ Table Q] and Fig. Q] show that the typical 
relative age error as a function of the primary star mass is about 
7%. The envelope is larger for models near the ZAMS, where 
the envelope is highly asymmetric and - for the primary star - 
it ranges from -23% to 93%, with a clear edge-effect distortion 
(see the extensive discussion in V14 and V15). For relative ages 
of the primary star larger than 0.4 the precision is always bet¬ 
ter than 12%. This accuracy is about four times better than is 
attainable by adopting asteroseismic constraints (i.e. the aver¬ 
age large frequency spacing Av and the frequency of maximum 
oscillation power v max ), without the knowledge of mass and ra¬ 
dius of the stars (V15 and references therein). Moreover, in V15 
we showed that the systematic in age estimates due to different 
stellar codes was of the same order of magnitude as the random 
errors. Since the random component is significantly smaller for 
binary systems, we expect the systematic bias due to the adop¬ 
tion of different grids of stellar models to play a major role in the 
age determinations. 

Several resulting features may depend in principle on the 
technique employed to obtain the system age and on the adopted 
perturbation strategy, so we devoted Sections l3. 1 | and [3.2l to dis¬ 
cussing the influence of these possible bias sources. A detailed 
discussion of the sampling strategy adopted to build the synthetic 
dataset and an analysis of its effect on the obtained results can 
be found in Appendix lAl 


3. 1. Effect of correlations 


To simulate the observational uncertainties (see Sect. 0. we 
added a Gaussian perturbation to the four observables (i.e. M, 
R , 7/fr, [Fe/H]) of the stars in the previously generated synthetic 
dataset of artificial binary systems. In the standard scenario they 
are sampled from a multivariate normal distribution assuming 
correlation of p = 0.95 between the two effective temperatures, 
p = 0.95 between the two metallicities, p = 0.8 between the two 
masses, and zero between the two radii. These correlations arise 
from the fact that the analysis of data for binary stars usually 
determines some function of the observables at higher accuracy 
than the observables themselves. 

The strongest correlation is expected between the effective 
temperatures of the two stars, since the difference in or the ra¬ 
tio of the effective temperatures in an eclipsing system is usu¬ 
ally determined with relative accuracy that is about three times 
greater than that of the individual effective temperatures them- 
selvestseeeu*. QMe^2003ySoudiworth - &Clausen|2007^South- 


worth l2013i ; lTorres et al.1120141) . A direct computation, by means 
of error propagation under normality assumption, on the data 
presented in the cited literature showed that the correlation of 
the two effective temperatures is reproduced well when assum¬ 
ing p = 0.95. 

Regarding [Fe/H], we adopted a high correlation coefficient 
(p = 0.95) since usually only a mean value for the entire system 


5 The half width of the windows is chosen to maintain the error on the 
lcr mass envelope due to Monte Carlo sampling at a level of about 0.2%, 
without introducing too much smoothing. The corresponding error on 
the 2cr envelope - relative to 2.5th and 97.5th quantiles - is about 1%. 

6 As usual, q is defined as the ratio between the masses of the sec¬ 
ondary and the primary stars. 
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Fig. 1. Left: age relative error in dependence on the mass of the primary star with its lcr envelope (red solid line). The blue long dashed line marks 
the relative errors median. Right: same as the left panel , but in dependence on the relative age of the primary star. 


is presented in the analyses. For mass determination, the mass 
ratio q is usually determined with relative accuracy of about 1.6 
higher than the mean one of the masses themselves (see e.g. Pop¬ 
per et al.ll986l;lLacv et al.l 2008t Torres et al.l 12009 ^Brogaard 
et al. 120111 : 1 Vo s et al.ll2012 : TSandauist et al.ll2013l) . Assuming 
normality, this corresponds to a correlation of p — 0.8 between 
the two star masses. However, even significantly lower values 
for t he correlation (p 0,15) resulted from lite rature examina¬ 
tion {Hetminiak et al. 2009; Meibo m et al JI20091) . 


The situation is much more complex in the case of radii 
determination. The light curve analysis poses independent con¬ 
straints on the sum of radii r s and on their ratio k , causing a de¬ 
pendence between the two evaluated stellar radii; however, the 
magnitude and the sign of the correlation between the estimated 
radii depend on the actual values of r s and k and of their er¬ 
rors. Direct computations from the values quoted in the literature 
showed correlations varying from p — 0.8 t o 0 = -0,5 ( Ponner 
et al. 119861: iGrundahl et al .l 120081: [Tor res et al.l 120091: Brogaard 
et al. l201 lb I Vos et al.ll2012l) . Luckily, such high variability is not 
a serious problem because, as shown below, the actual magnitude 
of the correlation between radii has very little impact on the re¬ 
sults. Thus, for the reference scenario we assumed uncorrelated 
radii. 


To quantify the impact on the age estimate uncertainty of 
accounting for the aforementioned correlations in the procedure 
followed to build the synthetic datasets, we applied the SCEPtER 
pipeline on datasets produced by relying on different assump¬ 
tions about correlations. As a first test, we provided a dataset 
of artificial binary stars with uncorrelated observables; i.e., the 
Gaussian noise was added assuming a diagonal covariance ma¬ 
trix. Although such a choice is quite extreme and not realistic, 
the comparison between the outcomes of the age estimate proce¬ 
dure on this dataset with those on the standard one will quantify 
the maximum effect of the correlations. Figure 0 shows the im¬ 
pact of the correlation between observables on the relative errors 
in the estimated age of the system. The neglect of correlation 
on the corresponding observables of the two stars causes a mean 
relative shrinkage of the envelope of about 20%. Direct simula¬ 


tions (not shown) proved that the shrinkage is due to dropping 
the correlation between the two masses, while the correlations 
between the effective temperatures and between the metallicities 
account for modest variations. 

As a second test, we built two synthetic datasets by assuming 
the same correlations of the standard case, with the exception of 
the correlation coefficient between the radii of the binary compo¬ 
nents, namely p = +0.7. The comparison of the performances of 
the recovery procedure on these datasets with those on the stan¬ 
dard one will assess the impact of the correlation amongst the 
radii. Table [2] reports, as a function of the mass of the primary 
star, the results obtained accounting for correlations between the 
radii. The mean variations on the envelope boundaries are about 
0.2%, so it is safe to neglect the correlation among radii. 

3.2. Recovery algorithms 

Figure[2]also shows the impact of the adopted age-recovery tech¬ 
nique. The simple average of the individually independent age 
estimates of the two stars appears to be as good as the joint es¬ 
timate in the case of a primary star of low mass (M <1.1 Af 0 ), 
while it shows a notable larger variance in binary systems com¬ 
posed of a high-mass primary and a low-mass secondary. For 
these combinations of objects, the secondary stars is sampled in 
the first stages of the evolution, since the maximum time is set by 
the faster evolution of the more massive star. The age estimate 
of a star in the early evolutionary stages is subject to larger rel¬ 
ative uncertainty, and this uncertainty is propagated in the final 
mean. Such an effect is not present in the joint evaluation, since 
in this case the age is mainly determined by the more evolved 
star. In fact the age spread of the models in the 3cr box around 
the primary star is generally lower than the spread the secondary 
star, since the primary is sampled at a higher relative age than 
the secondary. Therefore the joint likelihood estimate is limited 
by the range of age spanned by the primary star. 

The increase in the variance in age estimates when coevality 
is not explicitly assumed is shown further in Fig. [3] The top row 
of the figure shows, as a function of the masses M\ and M 2 of 
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Table 1. SCEPtER median ( p 50 ) and lcr envelope boundaries (p 16 and p 84 ) for age relative error as a function of the masses, relative ages, and 
mass ratio of the stars. Values are expressed as percent. 
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Pm 
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4.9 

4.3 

3.9 






Mass ratio q 
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Table 2. Median (p 5 o) and lcr envelope boundaries (p 16 and p S4 ) for age relative error as a function of the mass of the primary star, assuming 
correlations p = ±0.7 between the radii. 
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the binary system components, the position of the 2D lower (left 
panel) and upper (right panel) lcr envelope^ To summarise the 
results, the figure reports the mean value (expressed as percent) 
of the envelope boundaries in three mass ranges: M\ < 1.1 M 0 
and Mi < 1.1 M Q ; M\ > 1.1 M Q and M 2 > 1.1 M Q ; M\ > 1.1 
M 0 and Mi <1.1 M Q . The lower row of the figure displays the 
ratio of the boundaries for joint likelihood and independent age 


7 The bidimensional envelopes are computed with a similar technique 
to the ID ones, with mass steps of 0.1 M 0 and moving-window half 
width of 0.08 M 0 . The mass binning choice is such that the envelope is 
affected by a typical random uncertainty of about 0.5%. 


estimates. It is apparent that the two age-estimation techniques 
give nearly equivalent results as long as the system stars are close 
to each other in mass (q =s 1), while for unbalanced systems, the 
ratio of the boundary is as low as 0.2 for M\ = 1.6 M Q , Mi = 0.8 
M 0 , which is, an increase in the envelope width of a factor of 
five. 

3.3. Contribution of the secondary star to age estimates 

In Sect. l3.2l we discussed the dominant impact of the primary star 
on joint likelihood age determination. It is therefore interesting 
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Fig. 2. Upper row: lcr envelope of the age relative error in dependence on the mass of the primary star (left panel), and on its relative age (right 
panel). The solid black line corresponds to accounting for correlations in covariance matrix (see text) and assumes that the stars are coeval; the 
red solid line assumes the same covariance matrix without assumption of coeval stars in the recovery (see Sect. H; the dashed line assumes coeval 
stars and a diagonal covariance matrix. Lower row: same as the upper row, but for the secondary star. 


to explore the contribution of the secondary star to the whole 
system age determination. We performed this test by compar¬ 
ing the envelopes for the joint likelihood age estimate and for 
the primary single-star estimates. The results are presented in 
Fig. 01 Computed as a function of the primary mass, the joint 
estimate’s relative error envelope shows negligible differences 
for both the upper and the lower boundaries with respect to the 
envelope computed only for primary stars. The contribution of 
the secondary star is only visible for young systems, where the 
age of the primary star is less constrained. In fact, the right-hand 
panel of the figure shows that the relative age envelope that takes 
the secondary contribution into account is less prone to age un¬ 
derestimation at a relative age lower than 0.15. In summary, the 
secondary star plays a minor role with respect to the primary one 


because its age is generally estimated with a larger uncertainty 
being closer to the ZAMS than the more massive companion of 
the same age. 

4. Stellar model uncertainty propagation 

The accuracy of age estimates of real binary systems obviously 
depends on the reliability of the adopted grid of stellar models. 
In this section, following V14 and V15, we provide an analy¬ 
sis of the impact of the various uncertainty sources that affect 
the stellar computations. We focus here on some issues consid¬ 
ered in V15: the convective core overshooting efficiency, the ini¬ 
tial helium content, and the neglect of the microscopic diffusion. 
We constructed several artificial grids characterised by the vari- 
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Fig. 3. Upper row: left lower boundary of the lcr 2D relative error envelope for joint likelihood estimates as a function of the mass of the binary 
system stars. The percentages on the plot refer to mean values in different ranges of mass (see text). Right: same as the left panel, but for the 
upper boundary of the 2D relative error envelope. Lower row: left ratio of lower boundary of the lcr 2D relative error envelope due to different 
recovery techniques. The ratio is computed by dividing the position of the joint likelihood estimates boundary by the corresponding value obtained 
averaging the independently estimated ages. Right: same as the left panel, but for the upper boundary of the 2D relative error envelope. 


ation in the single input under examination up to its extreme al¬ 
lowed values, while keeping all the others fixed to their refer¬ 
ence values. We then built the corresponding synthetic datasets 
by sampling artificial binary systems from these non-standard 
grids of stellar models and by adding a Gaussian noise in all the 
observed quantities to simulate observational uncertainties fol¬ 
lowing the same prescriptions for covariance matrix described 
in the standard case. Finally, the ages are estimated by means of 
the SCEPtER pipeline by relying on the standard grid of stellar 
models. 

Although the use of a solar-calibrated mixing-length value 
for stars that differ from the Sun for mass, composition, or evolu¬ 
tionary phase could be inappropriate in the present paper; unlike 
V15, we do not consider this source of uncertainty. Such a choice 
is motivated by the fact that the dependence of the mixing-length 
value on the mass, metallicity, level of activity, and the evolu¬ 
tionary phase of the star is still under study and that no definitive 
conclusion has been reached (see, among many. [Clausen et ahl 


12009 ■ Deheuvels_^^iche| 20H b Bonaca_e^_af[ 2012LMathur 


et al. 12012t Tanner et al. 20141) . In V15 we studied the impact 
on age estimates due to a change in the mixing-length value 
by adopting synthetic grids with non-solar mixing-length values, 
which did not vary inside a given grid regardless of the mass and 
metallicity of the star. Since in the present work, we consider 
systems of stars with different masses, hence in different evolu¬ 
tionary stages, we feel that the approach of V15 might not be ap¬ 
propriate. Besides the effect of a change in the mixing length of 
the sampling grid, a differential effect of the possible evolution 
of the mixing-length value with the mass of the stars could be 
important. Nevertheless, we performed a numerical experiment, 
by sampling from grids with <r m | ± 0.24 with respect to our solar- 
calibrated value (o' m | = 1.74), which showed a bias ranging from 
about +4.0% to +6.0%, the lowest values for mass ratio q near 
1.0. These simulations suggest that the bias due to the different 
choices of the mixing-length values is certainly important but 
not dominant. For a more rigorous investigation, better knowl- 
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Fig. 4. Left', lcr envelope of the age-relative error in dependence on the mass of the primary star (solid line), compared with the same quantity 
computed only with the data of the primary star (dashed line). Right: same as the left panel , but as a function of relative age of the primary star. 


edge of the dependence of the external convection efficiency on 
stellar parameters, such as mass, chemical composition, evolu¬ 
tionary phase, and stellar activity, is mandatory. 

The observables adopted here in the age reconstruction are 
different from those in V15, which made use of classical (i.e. the 
effective temperature of stars and their metallicity [Fe/H]) and 
asteroseismic (i.e. the average large frequency spacing Av and 
the frequency of maximum oscillation power v max ) observable 
constraints. We therefore expect that the impact of the uncer¬ 
tainty sources is different from what is found in our previous 
work. In particular, the availability of precise stellar masses is 
paramount since it severely restricts the number of stellar mod¬ 
els that can actually contribute to the recovery procedure. 

4. 1. Convective core overshooting 

As in V15 we parametrised the extension of the extra¬ 
mixing region beyond the canonical border, as defined by the 
Schwarzschild criterion, in terms of the pressure scale height H p : 
Z ov = pil p , where p is a free parameter. To quantify the impact of 
taking the convective core overshooting into account, we com¬ 
puted - only for models more massive than 1.1 M e - two addi¬ 
tional grids with values of p = 0.2 and 0.4, the latter representing 
a pos sible maximum value (see e.g. the discussion in Valle et alJ 
l2009t) . We then extended these two grids of stellar models at 
masses lower than 1.1 M 0 with the standard one, which neglect 
core overshooting. 

From these non-standard grids we built two synthetic data 
sets (for p = 0.2 and p = 0.4) by sampling N = 50000 ar¬ 
tificial binary systems each and adding a Gaussian noise. The 
age estimate is then performed by adopting our standard grid of 
models (without overshooting) in the recovery procedure. The 
results are shown in Tables 0 and [4] and in Figs. [5] and [6] To eval¬ 
uate the magnitude of the obtained biases, they can be compared 
with the standard lcr envelope due to observational uncertain¬ 
ties in Table Q] It turns out that the median bias due to the mild 
(i.e. p = 0.2) core overshooting scenario - computed as func¬ 
tion of the mass ratio q - is about 50% of the half width of the 


standard envelope uncertainties. The corresponding bias for the 
strong (J3 = 0.4) core overshooting scenario is about 120% of 
the standard envelope half width. In both cases the relative im¬ 
portance of the bias is generally greater for low values of q\ for 
the strong overshooting scenario, in particular, the bias reaches 
values of about 160% of the standard envelope half width at q = 
0.5, while it is only 60% at q = 1.0. This trend is expected since 
part of the systems with q near 1.0 are composed of stars with 
both masses lower than 1.1 M Q , for which core overshooting is 
not taken into account so that the discussed effect is partially 
masked. 

Figure [6] shows that the dominant effect for the bias is due to 
the mass of the primary star. As an example, the mean bias for 
mild overshooting scenario (left panel in the figure) is -0.2% for 
a primary star that is less massive than 1.1 M e , while it is -3.7% 
for a massive primary coupled to a light secondary star. The shift 
due to having a massive secondary star is -6.0%, so it only ac¬ 
counts for an additional -2.3% change. The contribution of the 
secondary star is even less important in the strong overshooting 
scenario (right panel in the figure), when it accounts for about 
one-fifth of the primary star one. 

The results presented in this section have particular impor¬ 
tance since binary systems are often adopted to calibrate the ef¬ 
ficiency of the convective co r e overshooting by isochrone fine 
tunin g (see e.g. IClareHl2007fc lLacv et aD 120081 : IClausen et alJ 
I2010T) . Although our results do not directly allow estimation of 
the errors in the recovered best convective core-overshooting pa¬ 
rameter that is consistent with the observations, we found that 
the bias induced by a mild convective core overshooting sce¬ 
nario on age estimate is about one-half of the lcr expected error 
in age estimate. This implies a difficulty unambiguously identi¬ 
fying the overshooting effect, which can be masked by random 
fluctuations. 

The present results suggest that a mild overshooting scenario 
is hardly distinguishable from a no overshooting one. This result 
agrees with the finding by lClaretl d2007l) in the lower mass range 
considered in that paper. Although a statistical calibration - ob¬ 
tained from several systems - can be meaningful, the large errors 
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Table 3. SCEPtER bias on the recovered age due to the change in the stellar code input. Median age relative errors (/?!?„ and p s 50 ) are reported as a 
function of the mass of the primary and secondary star. Values are expressed as percent. 
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Table 4. SCEPtER bias on the recovered age due to the change in the stellar code input. Median age relative errors are reported as a function of 
the mass ratio of the binary system. Values are expressed as percent. 
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Fig. 5. Left, standard lcr envelope of the binary system age relative error in dependence on the mass of the primary star (solid black), compared 
with the same quantity obtained by sampling from grids with different core -overshooting efficiency £ = 0.2 (dashed red) and £ = 0.4 (dot dashed 
green). Middle: same as the left panel, but in dependence on the mass of the secondary star. Right: same as the left panel, but in dependence on the 
system mass ratio q. 


that propagate into the final calibrated overshooting parameter 
shed some doubt on the calibration of single systems. Owing to 
the widespread use of binary systems for calibration purposes, 
we feel that further research is needed to statistically quantify 
the errors in the calibrated parameters. 

4.2. Initial helium abundance 

We quantified the impact on the age estimate of the current un¬ 
certainty in the initial helium abundance following V15. We 


computed two additional grids of stellar models with the same 
metallicity values Z as in the standard grid, but by changing the 
helium-to-metal enrichment ratio AY /AZ to values of 1 and 3. 
Then, we built two synthetic datasets, each of N = 50 000 ar¬ 
tificial binary systems, by sampling the objects from these two 
non-standard grids and adding Gaussian noise. The age of the 
objects is reconstructed using the SCEPtER pipeline relying on 
the standard grid with AY/AZ = 2. 

The results are presented in Tables [3] and [4] and in Figs.[7]and 
[8] The mean impact of the initial helium abundance uncertainty 
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Fig. 6. Left : relative age bias due to the mild core overshooting as a function of the masses of the binary system. Artificial stars are sampled from 
the grid with mild core overshooting (J3 = 0.2) and their age is estimated on the standard grid. Right: same as the left panel , but for the strong 
overshooting scenario (0 = 0.4). 
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Fig. 7. Left: standard lcr envelope of the binary system age relative error in dependence on the mass of the primary star (solid black), compared 
with the same quantity obtained by sampling from the grid with a different initial helium abundance computed assuming AY/AZ = 1 (dashed red) 
and AY/AZ = 3 (dot dashed green). Middle: same as the left panel , but in dependence on the mass of the secondary star. Right: same as the left 
panel, but in dependence on the system mass ratio q. 


is higher than for the convective core overshooting. The median 
bias for AY/AZ = 1 - computed as a function of the mass ratio q 
(Table 0} - is about the -140% of the half-width of the standard 
lcr envelope due to observational uncertainties reported in Ta- 
bleQ] The corresponding bias for AY/AZ = 3 is about 160% of 
the standard envelope half width in TableQ] Figure[8]shows that 
both the changes in the bias due to the primary and secondary 
stars are modest, so, in the examined mass range, the initial he¬ 
lium induced bias can be considered almost constant at about 
±10% for a change in AY/AZ by ±1. 

The results presented in this section on the initial helium un¬ 
certainty impact are very different from those reported in V15, 
where this uncertainty source have been found to be nearly negli¬ 
gible. This striking difference results from the observational con¬ 
straints adopted in the estimation. The analysis in V15 in fact 
showed that the little influence of the helium content on age es¬ 
timates came from two opposite effects that nearly cancel each 
other. The first obvious effect was due to the impact of the helium 
content on the evolutionary time scale, so that helium-rich mod¬ 


els evolve faster. A second effect was due to the placement on 
the standard recovery grid of the non-standard helium models; 
helium-rich models mimicked more massive standard models, 
leading to a mass overestimation bias and therefore to reduced 
age estimates. The two effects compensate for each other. 

This balancing effect cannot occur here since the mass is 
adopted as an observational constraint, and this leads to the 
strong influence of the initial helium content on the final age 
estimates. 

4.3. Element microscopic diffusion 

We evaluated the impact on the age estimates of neglecting mi¬ 
croscopic diffusion in the stellar models adopted in the recovery 
procedure following, as in V15, a different approach with respect 
to the previous cases. Here we preferred to build the synthetic 
data set by sampling N = 50 000 artificial binary systems from 
stellar models that take diffusion into account and by adding 
Gaussian noise. We then reconstructed their age by means of the 
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Fig. 8. Left: relative-age bias due to adopting a low initial helium abundance as a function of the masses of the binary system. Artificial stars are 
sampled from the grid with AF/AZ = 1, and their age is estimated on the standard grid (AF/AZ = 2). Right: same as the left panel, but sampling 
from a grid with AF/AZ = 3. 



Fig. 9. Left : standard lcr envelope of the binary-system age -relative error in dependence on the mass of the primary star (solid black), compared 
with the same quantity obtained by sampling from the standard grid and reconstruction on a grid of stellar models computed without microscopic 
diffusion (dashed red). Middle: same as the left panel, but in dependence on the mass of the secondary star. Right: same as the left panel, but in 
dependence on the system's mass ratio q. 


SCEPtER pipeline but adopting the non-standard grid of models, 
which neglect diffusion. 

The results are presented in Tables [3 and [4] and in Figs. [9] 
and [TO] The bias due to the microscopic diffusion has a mean 
value of 3.6% (see Tabled, it is similar in absolute value to the 
one due to mild overshooting, and it ranges from about one-half 
to one-third of the bias due to initial helium change. Figure [TO] 
confirms an expected result that the diffusion is most important 
in systems composed of two low-mass stars. For primary and 
secondary masses lower than 1.1 M e , the bias in age estimates 
is 5.6%, while it drops to 3.7% for more massive primary stars. 
The trend can be understood since the evolutionary time scale of 
these stars is faster than for the diffusion processes, which are 
therefore quite inefficient (see left panel in Fig. [9}. 

As already discussed in Sect. 14.21 this result confirms that 
the bias in age grid estimates, owing to the different input in 
the evolutionary stellar codes, and strongly depends on the as¬ 
sumed observational constraints. In fact V15 has shown that - if 
adopting asteroseismic constraints - the microscopic diffusion- 


induced bias dominated the other examined biases, while it is not 
the case here. 

5. The coevality problem: a statistical approach 

In the literature there are several cases of real eclipsing binary 
systems whose components are estimated to be non-coeval, be- 
cause they cannot be fitted s imultaneously by a single isochrone 
(e.g. IClausen et all 120091 : IVos et all 120121 : ISandauist et al.l 
120131) . Such an occurrence is often used to claim that the cur¬ 
rent generation of stellar models present some weaknesses and 
that additional physical processes should be included or re¬ 
calibrated. Given its considerable implications, it is worth dis¬ 
cussing whether the estimates of non-coevality amongst the two 
components of real binary systems are statistically reliable or 
are simply a fluctuation. In this regard it is of particular interest 
to quantify the expected difference in the estimated ages of two 
genuine coeval stars caused simply by the observational uncer¬ 
tainties. To do that, we analysed our previously described syn¬ 
thetic datasets of coeval binary stars. 
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Fig. 10. Relative-age bias due to neglecting elements diffusion as a func¬ 
tion of the masses of the binary system. Artificial stars are sampled from 
the standard grid, and their ages are estimated on a grid computed with¬ 
out microscopic diffusion. 


Fig. 11. Critical values VV 0.95 of the statistic W for the expected differ¬ 
ences in the ages of the two stars, only due to the observational uncer¬ 
tainties (see text). 


As detailed in Sect. [2] the artificial binary systems sampled 
from the grid of stellar models are subject to random perturba¬ 
tions to account for the observational errors. Therefore the ages 
of the single stellar members, which are equal by construction 
in the sampling stage, are generally estimated as different. It is 
interesting to estimate how large the expected differences are in 
age owing only to these random perturbations, since this effect 
also interests real world observations. 

A detailed analysis of the problem has to deal with many 
technical aspect^ and it is therefore outside the aims of the 
present work. A devoted paper addresses the statistical aspects of 
this question presenting more detailed results (Valle et al. 2015, 
in prep.). Nevertheless, we present some basic findings here that 
were obtained with the standard sampling described in Sect. [2] 

We define A\ and A 2 as the estimated ages of the two mem¬ 
bers, with Ai > At. We focused our attention on the statistics 
W = (Ai - A 2 VA 1 , which then ranges from 0, when the stars 
are correctly estimated coeval, and 1 , when one star is estimated 
to be much younger than the other. To develop a statistical test 
based on W we have to estimate how large W can become only 
because of the random fluctuations in age estimates Ai and A 2 . 
Once the distribution of W - by means of a Monte Carlo simu¬ 
lation has been empirically estimated, it is possible to choose a 
critical value by identifying the range of values of W that are too 
extreme to be compatible with the coevality hypothesis. Usually 
the critical value is chosen as the 95th or the 99th quantile of the 
distribution of the statistics under consideration. The choice of 
the critical value to adopt (in the following, we refer to this as 
Wi-q.) defines the "level" a of the test (a thus being the prob¬ 
ability that an observed value of W is larger than W The 
set of values higher than the critical values are said to lie in the 
"rejection region". 

We present the rejection regions at level a = 0.05, hence the 
critical values W0.95 identifying them, of the null hypothesis that 
the stars are coeval. Reconstructed values of W higher than the 


8 Such as the choice of the sampling scheme for the binary systems, 
the evaluation of the precision and the accuracy on the estimated critical 
values, and of the sample size needed to attain a given level of precision. 


critical value Wo .95 lead to the rejection of the null hypothesis, 
implying that standard stellar models cannot account for the co¬ 
evality of the stars with the assumed input or parameters. 

A first obvious complication comes from the value of W 0.95 
depending on the mass of the two stars, on their metallicity, and 
on the (observationally unknown) relative age. Here we focus on 
the impact of the binary star masses. 

To perform the critical-value estimation, we generated a sam¬ 
ple of N = 50 000 artificial binary systems, perturbed their ob¬ 
servables as explained in Sect. [2] and then independently esti¬ 
mated the ages of the two stars. From these estimates, we com¬ 
puted the statistics W. Then we binned the W values according 
to the primary and secondary stellar masses, as for constructing 
the 2D envelope in Sect. 13. II In each of these bins, we computed 
the empirical W 95th quantile, which approximates the required 
critical value. A detailed discussion of the dependence of such 
a critical value on the actual uncertainties affecting the observa¬ 
tional constraints and on the evolutionary phase of the two binary 
members will be presented in a forthcoming paper (Valle et al. 
2015, in prep.). 

The results of the analysis are displayed in Fig. |TT] The fig¬ 
ure shows that genuine coeval stars can be reconstructed as non¬ 
coeval with a sizeable relative age difference only because of 
the current uncertainty in the observational constraints. While 
the critical value W 0.95 is about 0.25 for systems of nearly equal 
masses, it can be greater than 0.6 for unbalanced binary masses. 

This result should be carefully considered when determining 
the stellar ages of a binary system. Thus, a determination of two 
discrepant stellar ages, which leads to a value of W < W 0 . 95 , does 
not allow a statistically grounded rejection of the hypothesis that 
the stars are coeval since the difference might only be due to a 
random fluctuation in the observational constraints. 

As discussed in Sect. 14.11 one should be aware of this ef¬ 
fect whenever the extent of convective core overshooting is cal¬ 
ibrated on eclipsing binary by requiring that both stellar compo¬ 
nents are simultaneously fitted by a single isochrone, i.e. by im¬ 
posing their coevality. If the ages estimated by means of stellar 
models without core overshooting give a value of W < Wo. 95 , one 
cannot conclude on statistical grounds that these models are not 
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able to simultaneously fit both stars and that a convective core 
overshooting must be taken into account. Therefore, the reliabil¬ 
ity of the calibration method of the convective core overshooting 
consisting in the isochrones fine tuning to the observation has to 
be cautiously evaluated. 

In contrast, when standard stellar models provide ages of 
the binary components such that W > W 0 . 95 , the possibility that 
non-coevality is the result of observational uncertainties should 
be considered as negligible. As an example, binary systems in 
which at least one star is suspected of supporting a strong sur¬ 
face activity, generally show a large discrepancy among the ages 
of th e two components, such a s V636 Cent and EF Aqr ( Clausen 
et al. [20091 : 1Vos et alJl2012fl . In this system, the active secondary 
stars are estimated to be much older than the primary, far above 
the limit due to random fluctuations. The coevality is recovered 
adopting a significantly lower mixing-length value for the sec¬ 
ondary stars. This is an expected behaviour since it is recognised 
that in active stars the dynamo-induced magnetic fields can sup¬ 
press convection and produce starspots, causing differences in 
temperature and radii with respect to standard stellar models, 
which can be mimicked by a lower mixing- length value (Gabriel 
119691 : ICox et alJfl98ll : IClausen et al.lll999h . It is, however, clear 
that further research is mandatory to determine the expected er¬ 
rors in the calibrated mixing length in a sound statistical way. 

6. Conclusions 

We performed a detailed theoretical investigation of the age es¬ 
timation of detached double-lined eclipsing binaries by means 
of a maximum likelihood grid-based technique. We analysed 
the impact of the current uncertainty on several input of stel¬ 
lar model computations, such as the chemical composition, the 
efficiency of convective core overshooting, and the efficiency of 
microscopic processes. 

We adopted the g rid-based pipeline SCEPtE R, which is ex - 
tensively described in i Valle et ali (120141 ) an d I Valle et al.l d20l5h . 
As observational constraints we assumed the stellar effective 
temperature, the metallicity [Fe/H], the mass, and the radius of 
the stars. The grid of stellar models, computed for the evolution¬ 
ary phases from ZAMS to the central hydrogen depletion covers 
the mass range [0.8; 1.6] M Q . 

We computed the statistical errors arising from the uncer¬ 
tainties in observational quantities. We found an overall rela¬ 
tive uncertainty of about +7% in age estimates of the system, 
which is nearly independent of the masses of the stars and of 
the mass ratio of the binary systems. However, as described in 
IValle et al. ( 2015 ). the relative error in the age estimates sen¬ 
sitively depends on the evolutionary phase and becomes larger 
for models near those ZAMS for which it can reach values of 
about 90% for the upper boundary, with an envelope half width 
of about 50%. The large uncertainty and the bias toward higher 
ages found near the ZAMS are mainly an edge effect distortion, 
since the estimated age cannot be negative. The relative error en¬ 
velope quickly shrinks as the stars evolve from the ZAMS, and 
for primary stars of relative age 0.3, the half width of the error 
envelope is about 15%. 

We studied the impact on the age estimates of different 
choices of the sampling algorithms, of the correlation between 
the effective temperatures, the masses, the radii, and [Fe/H] of 
the two binary components, and of the method of estimating the 
joint age of the stars. It resulted that the first source of uncer¬ 
tainty plays a minor role since it accounts for a variation in the 
relative error half width of about 1%. The neglect of correlations 
between the observables of the two stars was shown to reduce 


the age error envelope width by about one-fifth. The age esti¬ 
mates of the binary system obtained by averaging the ages of the 
two stars computed independently - without explicitly assum¬ 
ing coevality - lead to negligible differences only for stars of 
nearly equal masses. Whenever an unbalanced system is consid¬ 
ered (Mi >1.1 M 0 and Mo <1.1 M 0 ), the envelope width of the 
age relative errors for the average of independent age estimates 
is larger by a factor of two, but it can be as high as a factor of 
five for systems with q ~ 0.5. 

We evaluated the systematic biases on age estimates owing 
to the uncertainties in initial helium content, in the convective 
core overshooting, and in the microscopic diffusion efficiency. 
We found that the bias due to a mild (J3 = 0.2) and strong (J3 
= 0.4) convective core overshooting scenarios are about 50% to 
120% of the half width of the standard age relative error envelope 
due only to the observational uncertainties. The bias due to the 
initial helium content was explored assuming an uncertainty of 
±1 on the helium-to-metal enrichment ratio AT/AZ. The effect 
is about -140% and 160% of the lcr random error, so slightly 
more than that of the strong overshooting scenario. Finally, the 
bias due to the neglect of the element microscopic diffusion was 
about 60% of the half width of the standard age-relative error 
envelope. 

The comparison of these results to those reported in Valle 
et al. d2015( 7. which were computed by adopting asteroseismic 
constraints and without the knowledge of stellar mass and radius, 
showed relevant differences. In fact. lValle et al.l d2015h showed 
that the helium-induced bias on age estimates was negligible, 
while the bias due to the microscopic diffusion was dominant. 
These differences arise from the corre spon ding bias on mass es¬ 
timates that could occur in lValle et al.l (l2015h . which led to some 
compensations for the age estimates. These mechanisms cannot 
occur in the present study, since the stellar mass is assumed as 
an observational constraint. An important point is therefore that 
the bias in the grid-based age estimates - due to adopting dif¬ 
ferent input in a stellar evolutionary code - strongly depends on 
the assumed observational constraints and should be evaluated 
case-by-case whenever a new grid is adopted for estimates. 

We also introduced a statistical test on the expected differ¬ 
ences in the estimated ages of two genuine coeval binary com¬ 
ponents due simply to the observational uncertainties affecting 
the observables used in the recovery procedure. We found that 
the relative difference between the reconstructed ages of two co¬ 
eval binary members depends on the mass ratio q. For systems 
with q » 1, the relative difference in the inferred ages can be 
about 20%, while it is higher than 60% for system with q < 0.8. 
This result should be carefully taken into account before using 
the apparent non-coevality of stars in binary systems - i.e. the 
inability to find a single isochrone that fits both components - to 
claim the weakness of current generation of stellar models and to 
calibrate the efficiency of some poorly known physical process, 
such as convective core overshooting, by isochrone fine tuning 
with the observations. 
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Appendix A: Synthetic dataset: sampling strategy 

In constructing the sample adopted in the error estimation, we 
did not try to match any observed distribution of the mass of the 
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Fig. A.2. Joint density of mass ratio q and primary relative age in the 
sample of binary stars. The solid line displays the LOESS-smoothed 
trend of relative age versus mass ratio q. 


binary stars or of the mass ratio q. Since we primarily aimed 
to investigate the performances of grid-based age estimates of 
binary stars, we paid more attention when sampling the whole 
range in mass, metallicity, and evolutionary phases covered by 
the available grid of models. 

In detail, the datasets of artificial stars are sampled by adopt¬ 
ing the following scheme. First, a star is randomly sampled from 
the grid. No priors on mass, age, and metallicity are assumed in 
the sampling procedure. Then, a second star is coupled to the 
first one with two requirements: it must share with the first star 
both the same initial [Fe/H] and the age, with a tolerance of 10 
Myr. The couple of stars is re-ordered to have the most luminous 
star as a first member. 

To show which models are selected as primary and secondary 
stars, we estimated the joint density of mass and relative age for 
primary and secondary starfl From the joint density it is possi¬ 
ble to verify if there are ranges of mass or relative age that are 
preferably sampled. The results are displayed in Figure lATl It is 
apparent that the density functions for the primary and secondary 
stars are different and that they are both not uniform. In partic¬ 
ular, the sample of primary stars is biased towards high masses 
and high relative ages since these models have higher intrinsic 
luminosity. The distribution of secondary stars is more diffuse, 
since several young low-mass models are present in the sam¬ 
ple. Figure IAT21 shows the joint density of the mass ratio q and 
the primary relative age. The solid line in the figure is a LOESS 
smootheo of the relative age versus q. It is apparent that for bi¬ 
nary systems with near equal masses, the sampling returns more 
evolved stars. This bias comes from later evolutionary phases 
needing more points to adequately follow the rapid evolution. 
The median time step in this grid region is about 13 Myr. Once 
a model has been sampled in this grid region, several points in 
the same evolutionary track will pass the constraint on maximum 
age differences of 10 Myr from the first sampled model. There- 


9 The joint density was obtained by a bi-dimensional kernel density 
estimation using the function kde2d in the R library MASS. Details on 
the technique can be found in lYenables & Riplevl d2002l) . 

10 A LOESS (LOcal regrESSion) smoother is a non-parametric, locally 
weighted polynomial regression technique that is o ften used to show 
the underlying trend of s cattered data (see e.g. Feigelson & Babul l201 j 
Venables & Rinlev 2002). 


fore there is a high probability that a point in the same track is 
coupled to the first selected stars, leading to a mass ratio of 1.0. 
This phenomenon is not important in the reconstruction phase, 
since all these points have basically the same age, as discussed 
in V15. 

The effect of the sampling could be seen in Table [Q which 
shows a small shrink in the age relative error envelope at q 
greater than 0.9. This is a direct consequence of the relative er¬ 
ror envelope narrowing at high relative age. To show that this 
artificial trend could be removed with different sampling strate¬ 
gies, in Table [T] we also present two additional series of results. 
In fact, we first explored the impact of a different sampling. For 
this test, we sampled the set of primary stars, then we coupled 
each of those objects to the secondary star by sampling only 
from the models respecting the constraint in metallicity and age, 
which are also less massive than the primary object. The results 
are in the rows labelled “mass ratio q (alternative sampling)” in 
Table □ As a second test, to see the impact of the ad-hoc re¬ 
moval of the trend of primary relative ages versus q shown in 
Fig. IA.21 we checked an alternative approach that rejects some 
objects in the subset q > 0.9, based on the standard method. In 
fact, since this is the region that is more populated, it was possi¬ 
ble to draw from it a sample of size equal to that of the adjacent 
subset 0.8 < q < 0.9, with the requirement that the new sam¬ 
ple’s primary relative age matches the distribution of the subset 
0-8 < q < 0.9. The corresponding results are in the rows "mass 
ratio q (rejection step)" in Table Q] It appears that the shrinkage 
in the upper q region is reduced in both two cases. 

In contrast, the shrinkage of the error envelope at low mass 
ratio q (see Table [I]) is due to an edge effect since there are only 
models in this region with mass 1.6 M e coupled with 0.8 M Q 
models. In fact the lower left-hand region ofFig. lALZl is not popu¬ 
lated because to produce a coeval combination, the more massive 
model cannot be too young. In fact, in this case the secondary is 
still in the pre-main sequence phase, which is not considered in 
the grid. Figure IA.2I shows that the minimum relative age for 
these massive models is about 0.2. Therefore the most critical 
region for age estimation - that of low relative age - is avoided, 
and the estimate relative error envelope is narrower. 
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